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We consider a minimalistic dynamic model of the idiotypic network of B-lymphocytes. A network 
node represents a population of B-lymphocytes of the same specificity (idiotype), which is encoded by 
a bitstring. The links of the network connect nodes with complementary and nearly complementary 
bitstrings, allowing for a few mismatches. A node is occupied if a lymphocyte clone of the corre- 
sponding idiotype exists, otherwise it is empty. There is a continuous influx of new B-lymphocytes 
of random idiotype from the bone marrow. B-lymphocytes are stimulated by cross-linking their 
receptors with complementary structures. If there are too many complementary structures, steric 
hindrance prevents cross-linking. Stimulated cells proliferate and secrete antibodies of the same 
idiotype as their receptors, unstimulated lymphocytes die. 

Depending on few parameters, the autonomous system evolves randomly towards patterns of 
highly organized architecture, where the nodes can be classified into groups according to their 
statistical properties. We observe and describe analytically the building principles of these patterns, 
which allow to calculate number and size of the node groups and the number of links between them. 
The architecture of all patterns observed so far in simulations can be explained this way. A tool for 
real-time pattern identification is proposed. 

PACS numbers: 87.18.-h, 87.18.Vf, 87.23.Kg, 87.85.Xd, 64.60.aq, 05.10.-a, 02.70.Rr 



I. INTRODUCTION 

B-lymphocytes play a crucial role in the adaptive 
immune system. They express Y-shaped receptor 
molecules, antibodies, on their surface. Antibodies have 
very specific binding sites (idiotopes) which determine 
their idiotype. All receptors of a given B-cell have the 
same idiotype. B-cells are stimulated to proliferate if 
their receptors are cross-linked by structures which are 
complementary to the idiotopes, e.g. by foreign antigen. 
Stimulated B-cells thus survive whereas unstimulated B- 
cells die, this process is called clonal selection 

After a few generations, stimulated B-cells differentiate 
to plasma cells which secrete soluble antibody molecules 
of the same idiotype, which may bind to complementary 
sites on antigen and mark them for further processing. 

B-lymphocytes of different idiotype are continuously 
produced in the bone marrow in a remarkable diversity. 
The variety of the potential idiotype repertoire, created 
by somatic reshuffling of gene segments and mutations 
[2}, was combinatorially estimated Q to exceed most 
likely 10 10 . 

Complementary structures may be found on antigen 
but could also be situated on other antibodies of com- 
plementary idiotype. B-lymphocytes can stimulate each 
other and, thus, they form a functional network, the id- 
iotypic network [J|. 

Jerne's concept of the idiotypic network explains in a 
natural way the diversity of the expressed idiotype reper- 



* ulrich.behn@itp.uni-lcipzig.de 



toire and the autonomous dynamics of an immune system 
not exposed to foreign antigen. It provides a mechanism 
of immunological memory. Imagine that an antigen Ag 
is recognized by an antibody Abi. Thus, the clone of Abi 
expands and possibly meets another clone of complemen- 
tary idiotype Abi. Both mutually stimulate each other 
and they persist even after Ag has disappeared. Abi is 
structurally similar to Ag and can be considered as inter- 
nal image of Ag. Furthermore, the network is thought to 
control autoreactive clones. All these issues are beyond 
the concept of clonal selection. 

The network paradigm got an immediate enthusias- 
tic response and idiotypic interactions were considered 
as the major regulating mechanism of the immune sys- 
tem. However, the rapid progress of molecular immunol- 
ogy, difficulties in the direct experimental verification, 
and the discovery of other regulating mechanisms let the 
interest of experimental immunologists decay. Yet, for 
system biologists the network paradigm always remained 
attractive. Several aspects of the original concept were 
revised in due course. Most notably, Varela and Coutinho 
[1-0] suggested second generation networks with an ar- 
chitecture comprising a strongly connected central part 
with autonomous dynamics and a sparsely connected pe- 
ripheral part for localized memory and adaptive immune 
response. In a sense, they reconcile both paradigms of id- 
iotypic networks and clonal selection. A readable history 
of immunological paradigms can be found in Q . Reviews 
with focus on idiotypic networks are [9( with an emphasis 
on modeling approaches and more recently [Io| with em- 
phasis on new immunological and clinical developments. 

Today the main activities are in clinical research. Idio- 
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typic interactions are the base of all therapies with mono- 



clonal antibodies 11] . New experimental techniques 



make large-scale studies of the idiotypic repertoire fea- 
sible [HI which are necessary to infer the networks archi- 
tecture. 

In this paper we consider a minimalistic model of the 
idiotypic network, which was first formulated and inves- 
tigated in [13]. In this model a node represents lympho- 
cytes and antibodies of a given idiotype. Lymphocytes of 
complementary idiotype can stimulate each other. The 
corresponding nodes are connected by links. 

Idiotypes are represented by bitstrings [13] of length d, 
bdbd-i ■ ■ ■ &i with bi G {0, 1}. Ideally, d is chosen such 
that 2 d is the size of the potential repertoire. The nodes 
of the network are labeled by these bitstrings. Two nodes 
v and w are linked if their bitstrings are complementary 
allowing for up to m mismatches, i.e. their Hamming 
distance is d#{y, w) d — m. The corresponding undi- 
rected graph is the base graph Gr™' . Each node has the 

same number of neighbors, k = X^fcLo (fe) • ^ represents 
the potential idiotypic repertoire with all possible inter- 
actions. 

Not all idiotypes are expressed in the real network. 
In our minimalistic model we only account whether an 
idiotype is present or not, correspondingly the node is 
occupied, n(v) = 1, or empty, n(v) = 0. The subgraph 
of gJ°' induced by the occupied nodes represents the 
expressed idiotypic network at a given time. 

We describe the temporal evolution in discrete time. 
The influx of new idiotypes from the bone marrow is 
modeled by occupying empty nodes with probability p 
[l5j |. For survival a B-lymphocyte needs stimulation by 
complementary structures. The dose-response curve is 
log-bell shaped, cf. [l6| and Refs. therein. If there 
are too many complementary structures, crosslinking be- 
comes less likely due to steric hindrance, the stimulus is 
reduced. In our model an occupied node survives if the 
number of occupied neighbors is between two thresholds, 
ti and tjj. The rules of parallel update are 

(i) Occupy empty nodes with probability p 

(ii) Count the number of occupied neighbors n(dv) of 
node v. If n(dv) is outside the window [t£,t(/] , set 
the node v empty 

(hi) Iterate. 

Driven by the random influx of new idiotypes the net- 
work evolves towards a quasi-stationary state of non- 
trivial, functional architecture in which groups of nodes 
can be identified according to their statistical proper- 
ties. Crucial for that is besides the random occupation 
of empty nodes, that occupied nodes are emptied if linked 
with too few or too many occupied nodes. 

The paper is organized as follows. In the next section 
we discuss the model in its scientific context in other 
disciplines and its relation to other models of idiotypic 
networks. In Sec. Mil we provide simulation results. We 
sketch a typical random evolution of the system to make 



the reader familiar with the systems behavior. Consid- 
ering global and local network characteristics the exis- 
tence of groups of nodes which share statistical proper- 
ties [l3[ is confirmed and more details are revealed. In 
Sec. IIVI we describe certain regularities in the bitstrings 
of nodes which belong to the same group. The observed 
patterns can be constructed from pattern modules [171 ]. 
The construction principle is explained hrst for the sim- 
plest pattern and generalized afterwards. These findings 
allow to calculate the number of groups, the group sizes, 
and the linking between groups. A new observable, the 
center of mass, is introduced that proves very useful in 
real-time pattern identification. In Sec. [V]we apply this 
concept considering specific patterns observed in simula- 
tions, among them a dynamic pattern with core groups, 
peripheral groups, stable holes, and singletons that re- 
sembles in some aspects the biological network [ij Q . In 
the appendix we calculate the scaling of the relative size 
of these groups for systems of biological size. 



II. CONTEXT AND RELATED MODELS 

The topic falls into several scientific disciplines. It 
is natural to place our investigations in the context of 
network theory. Network theory has applications in a 
plethora of different, multidisciplinary fields [l8l - l2C| and 
has received great attention in the community of statis- 
tical physicists in the last decade. 

A major body of research deals with growing networks, 
where new nodes are attached to the existing nodes ran- 
domly, or depending on properties of the existing nodes. 
In this context deletion of nodes is only considered to 
study the resilience of the network against random or 
targeted attacks [2l|. For recent reviews see f22T(2~4j ]. cf. 
also [II [H. 

Natural networks however do not grow without limit 
but stay finite and evolve towards a functional archi- 
tecture. There are several modes to enable evolution: 
(i) Adding nodes but keeping the growth balanced by 
deletion [29 - [3l| or merging of nodes [32H34 ] . (ii) Keeping 
the nodes unaffected but add, delete, or reorganize the 
links [35l - l40l. (in) Addition and deletion of both nodes 
and links [4l|-|47l|. Again, this can be done randomly or 
depending on the properties of the nodes and its neigh- 
bors. 

Generic observables to characterize networks include 
the degree distribution, centrality, betweenness, cliquish- 
ness, modularity, clustering coefficients, and diameter. 
For instance, growing networks using preferential attach- 
ment have a power law degree distribution like many real 
world networks. However, the characteristic exponent of 
preferential attachment networks is larger than the one 
found in natural networks. This was a major motivation 
to study evolving networks. 

In many natural networks nodes have individual prop- 
erties which control their potential linking. Clearly, in 
our case this is the idiotype. Also protein networks, tran- 
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scription networks and generally signaling networks be- 
long to this class. 

Nodes may have an internal state which can change 
depending on their neighborhood in the network, or on 
external influences. This dynamics has a typical time 
scale that is shorter than the time scale for evolution of 
the network's architecture. The interplay of these pro- 
cesses came into the focus of research only in the last 
few years. For a recent review and a status report see 
[HI HH . Our model is a very early example where this 
interplay is studied [13j . In the present paper we describe 
the building principles of the architecture. 

Our network model is a Boolean network [5(|, since 
each node can be only in one of two states, empty or oc- 
cupied. The nodes are updated in consecutive time steps 
depending on its own and its neighborhood occupation. 

The model is also a cellular automaton, for a compre- 
hensive monograph see (5l| . More precisely, since the 
update depends only on the sum of the neighbor states 
and the state of the node itself, it is a totalistic cellular 
automaton. Cellular automata naturally involve unoccu- 
pied nodes. In our model unoccupied nodes, holes, play 
an important role. We distinguish holes that could be oc- 
cupied from stable holes which can not be occupied due 
to overstimulation. A simple network picture disregards 
nodes which are not occupied. 

A famous example of a totalistic two-dimensional de- 
terministic cellular automaton is Conway's Game of Life 
52] . Both, the survival of an occupied cell and the occu- 
pation of an empty cell, are governed by window rules. 
Many interesting patterns, depending on the initial con- 
ditions static or dynamic, have been described in detail 
and classified. The Game of Life was transfered to a va- 
riety of lattices, e.g. to three dimensional cubic lattices 
[53|, [54|, to triangular, pentagonal and hexagonal tessela- 
tions^to Penrose tilings [54145611 and to small world geom- 
Larger than Life 58-|60| increases the radius 
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of the neighborhood. Also, a probabilistic version on a 
2d square lattice has been proposed, where stochastic 
deviations from the deterministic update are permitted 
6 if . The mean occupation of cells, a global order pa- 
rameter, undergoes a sharp phase transition for increas- 
ing strength of stochasticity. On the occasion of the 40th 
anniversary appeared a comprehensive collection of re- 
cent results on the Game of Life and its descendants [HJ . 
Our model can be considered as a further version of the 
Game of Life on a high dimensional graph, where empty 
nodes are randomly occupied, while the survival of occu- 
pied nodes is governed by a deterministic window rule. 
Starting from an empty graph we observe an evolution 
toward a complex, highly organized architecture. 

The model can also be categorized as a stochastic, non- 
linear dynamical system. 

There exists a variety of models for B-cell networks. 
References 0, [l6| give comprehensive surveys of mod- 
elling approaches. 

For instance, Stewart and Varela [g3|, |64| proposed 
a model, which also has a random influx and a win- 



dow update rule to simulate the internal dynamics and 
a zero/one clone population. However, while we con- 
sider a discrete d-dimensional hypercubic shape space, in 
their model the complementary idiotypes live on different 
sheets of a 2D continuous shape space [65[ . A summary 
of results obtained in models with continuous shape space 
is given by Bersini [66] . Several aspects of modeling in 
continuous and discrete shape space are discussed in [67 1. 

An early network model inspired by spin glass physics 
was proposed by Parisi [6g| to describe immunological 
memory. The interaction between idiotypes in the model 
of Barra and collaborators [69l - l7l| is also taken from spin 
glass physics. Their model describes a given number of 
idiotype populations each with a constant number of lym- 
phocytes. Each cell can be in a firing or quiescent state. 
The strength of the ferromagnetic coupling between the 
idiotypes (also encoded by bitstrings) models the affinity, 
which is related to the complementarity of the bitstrings. 
Barra and Agliari [69|, [7QJ compute the degree distribu- 
tion, the type and number of loops, and consider the 
scaling behavior. They describe primary and secondary 
immune responses and understand low and high dose tol- 
erance as a network phenomenon. In our model the log- 
bell-shaped response is integrated as a B-cell property 
6, 16:]. Essential for our approach is a random influx of 
new cells and a selection mechanism. Hence, we can de- 
scribe the evolution of the network towards a functional 
architecture. 

A model which distinguishes between antibody 
molecules and lymphocytes with a dynamics including 
an influx similar to ours is proposed by Ribeiro et al. 
[72I ]. They compare the dynamics on given architectures 
of random and scale- free networks. 

IMMSIM, invented by Celada and Seiden [zl, 0|, is 
also a modified cellular automaton. It incorporates many 
immunologic agents, including antigen presenting cells, 
B-cells, T-cells, antigens, antibodies and cytokines. It 
describes both humoral and cellular responses. Idiotypes 
are also characterized by bitstrings. The vast amount 
of interacting agents increases drastically the number of 
parameters. The model is intended to be as realistic as 
possible and to provide experimental and practical im- 
munologists with a tool to test hypotheses in silico. Our 
approach is in a sense complementary. We aim at an un- 
derstanding of the principles governing the autonomous 
evolution toward a functional architecture. Therefore we 
investigate a minimalistic model with a small number of 
parameters, which nevertheless exhibits essential features 
of the biological network. 

The concept of idiotypic networks inspired applications 
in computer sciences, e.g. artificial immune systems for 
the detection of intrusion, of spam or viruses [7^, [76[ . 



III. SIMULATION RESULTS 



In this section we report on simulations on the base 
graph G^2i which consists of 2 12 = 4096 nodes, each of 
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which has k = 79 links to other nodes. The window rule 
parameters are [*£,,%] = [1; 10] • The lower threshold is 
biologically motivated, a node needs at least one occu- 
pied neighbor to survive. The upper threshold is chosen 
to enable a non-trivial, dynamic pattern of complex ar- 
chitecture. As discussed below in more detail, higher tjj 
would allow a broader variety of static patterns. How- 
ever, the concept developed in this paper still applies. 

The simulations start with an empty base graph. The 
influx p varies from to 0.11. This covers the range in 
which we find interesting patterns, above p jj, tu/n — 
0.127 there is only trivial random behavior. 



system size such catastrophes become less likely. In the 
thermodynamic limit we expect a breaking of ergodicity. 

In our finite system there can be transitions between 
different patterns on a route without extinction catastro- 
phe but via the formation of an intermediate, unstable 
giant cluster, for more details see (l3j . 

Modeling biological systems we should keep in mind 
that they are finite and have a finite life expectation. 
The quasi-stationary state could persist for times longer 
than the life span of the individual but transitions be- 
tween different states can not be excluded. Moreover, 
the parameters could vary during the individual's life. 



Random Evolution 



B. Global Characteristics 



We start the simulations with an empty base graph 
occupying nodes with probability p. In the first time 
step only those nodes survive which have at least one 
occupied neighbor (having more than tjj occupied neigh- 
bors is unlikely in the beginning). The surviving nodes 
represent seeds the neighbors of which will survive if oc- 
cupied. Hence, we observe a rapid growth of a giant 
cluster until more and more nodes have more than tjj oc- 
cupied neighbors. Exceeding the upper threshold deletes 
a node. Thus, the giant cluster decays and many stable 
holes are created, i.e. nodes with the number of occupied 
neighbors above the upper threshold tjj. Fig. Q] shows a 
time series for the evolution to a stationary state where 
the number of stable holes increases up to its stationary 
value. 

The empty base graph is a highly symmetric object. 
Due to the random influx the symmetry is broken and 
the system falls into a network configuration of lower 
symmetry. The typical result of the evolution is a quasi- 
stationary pattern of mutually dependent occupied nodes 
and stable holes. Properly situated occupied nodes create 
stable holes and in return stable holes in the neighbor- 
hood of an occupied node may prevent its overstimula- 
tion. 

In the generic case for a given p a certain pattern type 
is found most frequently. Occasionally, depending on the 
history of the driving process, also other patterns occur. 
Once established they all can live for a long time. (This 
can be proved in simulations preparing the pattern as 
initial configuration.) 

In principle, the system is ergodic [77]]. This becomes 
immediately clear considering the following unlikely but 
possible event. For any choice of p > the occupation 
of nodes by the influx can be such that the application 
of the window rule leads to an empty base graph. After 
this extinction catastrophe the further realization of the 
driving process, the influx, determines to which pattern 
the system evolves. For any p > an infinite trajectory 
contains partial trajectories leading to any possible pat- 
tern. In practice, in our simulations we have never seen 
such an extinction catastrophe but only quasi-stationary 
patterns usually living for a long time. For increasing 



A first characterization of the different patterns can be 
obtained considering global quantities. They include the 
number of occupied nodes on the base graph 



i(G) = 



(1) 



vEG 



where n(v) € {0, 1} is the occupation of node v, the size 
of the largest cluster in the set of present clusters C 



|C max |=max(|C|), 



and the average size of the clusters 



(2) 



(3) 



where Nq is the current number of clusters, and ( • )g 
denotes the average over the elements of some set S. 
Clusters, i.e. connected parts of the occupied subgraph, 
are very characteristic for patterns. Finally, we mention 
the number of stable holes h*(G), i.e. empty nodes with 
n(dv) > tjj. 

Figure [T] shows a generic time series of these global 
characteristics for a parameter setting where the system 
evolves to a stationary 8-cluster pattern. 

In the stationary state we can consider temporally av- 
eraged global quantities. The temporal average is defined 
as 



1 



Ti -Tn 



E 

teCTo.Ti] 



(4) 



where T should be larger than the relaxation time 
in which the system reaches the stationary state, and 
(To,Ti] is the averaging period. 

Table U gives results for three patterns which occur 
for different influx p. The static patterns are named ac- 
cording to the size of the characteristic clusters. With a 
deeper understanding of the architecture we shall name 
them by the number of groups of nodes, see Sec. IIVI be- 
low. 
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FIG. 1. Time series of the number of occupied nodes n(G), 
the size of the currently largest cluster |C max |, the average 
cluster size (|C|)c, and the number of stable holes h*(G) on 
the base graph G$ with [t L ,tu] = [1, 10] and p = 0.01. The 
system evolves to a stationary 8-cluster pattern. 



TABLE I. Temporal averages and standard deviations of 
global characteristics for three typical patterns. Data from 
500,000 iterations. 



Pattern 


8-cluster 


2-cluster 


dynamic 


P 


0.005 


0.015 


0.025 


n(G) 


1024.6 ± 0.8 


1023.3 ± 0.9 


533.4 ± 27.7 


h*(G) 


2816.0 ± 0.0 


3055 ± 20 


1828 ± 114 


|C max | 


8.0 ± 2.8 


4.5 ± 2.1 


410 ± 20 


(|C|) C 


7.96 ± 0.04 


2.01 ± 0.01 


4.49 ± 0.96 


N c 


128.6 ± 0.8 


510.4 ± 3.7 


123.1 ± 22.0 



The 8-cluster pattern at p = 0.005 has 128 clusters of 
size 8, which results in a total average population of 1024 
occupied nodes, i.e. one fourth of all nodes. The remain- 
ing nodes are holes, 2816ofwhich are stable (n(dv)>tjj), 
and 256 are not stable (n(dv) ^tjj). Empty nodes with 
n{dv) < tr, are not counted as stable holes, since they 
could easily become occupied and sustained by the ran- 
dom influx. Figure [T] shows the temporal evolution from 
an empty base graph towards an 8-cluster pattern. 

In the 2-cluster pattern at p = 0.015 the 510 clusters of 
size 2 together occupy about one fourth of the base graph. 
The remaining three quarters are stable holes. Defects in 
the perfect 2-cluster pattern allow that occasionally some 
of the node pairs become connected via a central hub and 
a larger star-shaped cluster is formed, see Fig. [2] t\j = 10 
was chosen to exclude the occupation of the hub in the 
perfect pattern. Both the 8- and the 2-cluster patterns 
are quasi-static, the temporal fluctuations are small. 

A more complex, dynamic pattern evolves for larger 
p <J 0.03. The standard deviation of the temporal av- 
erages is one order of magnitude larger than in the two 
static patterns. Figure [3] shows a snapshot of the occu- 
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FIG. 2. A selection of 2-clusters as they appear for moder- 
ate influx. We see ten 2-clusters of occupied nodes (black). 
They are all connected through a hub (gray). If the eleventh 
2-cluster (empty circles) becomes occupied, the hub would 
not survive the next update. There is one link (solid) with 
one mismatch, the other links (dashed) have two mismatches. 
The nodes are labeled with the decimal expressions of their 
bitstrings. Figure produced using yEd [781 ]. 



pied graph. We see one large cluster of about 400 nodes 
and about 120 isolated nodes. These nodes had at least 
one occupied neighbor which was removed in latest up- 
date. Further, approximately 1800 stable holes and 1800 
unstable holes have been observed, which are not shown 
in the figure. We can clearly distinguish a central and a 
peripheral part, which are supposed to be of functional 
importance in the biological idiotypic network. 



C. Local Characteristics 

Besides global quantities, time averages of local quan- 
tities characterizing every single node can be considered. 
Elucidating are the mean occupation n(v), the number of 
occupied neighbors n(dv), and the mean life time t(v), 
which is defined as 



t(v) 



1 



b(v) + n To 



E 

t€(T ,Ti] 



n t (v) . 



(5) 



where b(v) is the number of births during the observation 
time, i.e. the number of new occupations of the node by 
the influx. Of course, b(v) + tit 7^ must be fulfilled, 
otherwise t(v) has no meaning. 

We can identify groups of nodes sharing statistical 
properties as proposed in |13j . FigureUshows mean occu- 
pation, mean life time, and the number of occupied neigh- 
bors vs. influx probability. The groups appear as peaks 
in the histograms. The number of occupied neighbors 
proves most suitable to distinguish the different groups. 

For small and moderate influx a clear group struc- 
ture is visible. The data for the mean life time and the 
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FIG. 3. (Color online) Snapshot of the occupied graph of 
the complex configuration of a dynamic pattern for p — 0.025. 
The nodes are colored according to their membership in dif- 
ferent groups, see Fig. [15] (top). Nodes within a group have 
similar statistical characteristics, see text. Figure produced 
using yEd [z||. 



mean occupation suggest that the patterns are static for 
p £ 0.03. These patterns have groups of occupied nodes 
with a high mean life time, other groups are stable holes 
or sparsely occupied nodes. For 0.03 iS p J$ 0.08 the 
patterns are dynamic, but still stationary. Also in dy- 
namic patterns there are stable holes. The mean life 
time of occupied nodes is small, the occupied subgraph 
changes continuously. While in static and dynamic pat- 
terns all nodes remain in their groups, for high influx 
p 0.08 the patterns become short-lived, groups dis- 
solve and reemerge in a different configuration. Their 
distinction by temporal averages becomes more and more 
difficult. For very high influx p 0.12 the dynamics is 
entirely random. 



IV. CONCEPT OF PATTERN MODULES 

We find regularities in the bitstrings encoding nodes 
belonging to the same group. This allows to identify 
general building principles. Patterns are build from more 
elementary objects, so called pattern modules. With this 
concept we can derive all structural properties, the size 
of the groups and their linking, of the patterns observed 
so far in the simulations. 

We first explain the principles on the example of the 
2-cluster pattern, and develop then the detailed concept 
for the general case. At the end of the section, exploiting 
this concept we introduce the center of mass vector, a 
tool which allows to identify in simulations the majority 
of patterns in real time. 
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FIG. 4. Time averages of the mean occupation (top), the 
mean life time (center), and the mean number of occupied 
neighbors (bottom) of each node for increasing values of p, 
Ap = 5/4096. Each graph shows isolines of a histogram in 
top view. The grey scale reflects the number of nodes with 
the corresponding property. Regimes of different temporal 
behavior are separated by vertical lines. For each p data is 
from time series of 500,000 iterations in the stationary state, 



(2) 



starting from an empty base graph G 



A. Simplest Case: The 2-Cluster Pattern 

We introduce the building principles of patterns con- 
sidering the simplest pattern, the 2-cluster pattern which 
appears for moderate influx. In simulations starting 
from empty base graphs it is rare and only found for 
< p iS 0.03, but if prepared as initial condition it is 



very stable for a larger range of p up to 0.045. By their 
statistical characteristics, cf. Table HH we can distin- 
guish three groups of nodes, frequently occupied nodes 
(Si) with a high mean life time, permanently empty sta- 
ble holes (S3), and rarely occupied potential hubs (S2) 
which link together up to tu 2-clusters if occupied. 



TABLE II. Local characteristics of the three groups in the 
2-chister pattern for p — 0.025 compared with the ideal 
pattern. Data from a time series of 500,000 iterations. 







Si 


s 2 


S3 


mean occupation 


(n(v)) Si 

Wideal 


0.993 
1 


0.0004 



0.000 



occ. neighbors 


(jt(dv))si 

n(9«)idcal 


1.002 
1 


10.95 
11 


55.64 
56 


mean life time 


(~( v ))s, 


6923 


0.016 


0.000 



Looking at the node indices i v in decimal representa- 
tion we observed that the sum of the two indices in a 
2-cluster is constant in a realization. In a different real- 
ization the index sum can be different. For instance, in 
Fig. [5] the index sum within all 2-clusters is 6207. This 
indicates regularities at the level of the bitstrings. 

We found that all occupied nodes are identical in ex- 
actly two bits, say at position k and I . The members of a 
2-cluster are complementary in all other bits, in symbols 
we write 

• • is linked with • • • • • , 

where the bar denotes the bit inversion. The bitstrings 
of all stable holes are also equal in the same two bit po- 
sitions k and / . However, they are inverse to bk and bi 
of the occupied nodes. Potential hubs have exactly one 
inverse and one equal bit in these positions. As these bits 
play a crucial role, we call them determinant bits. The 
regularities are summarized by 

occupied nodes Si • ■ -bk — &r • ■ 

potential hubs S2 < ^ k — 

[ ■ ■ -b k - ■ -b r ■ ■ 

stable holes S3 . 

The example in Fig. [2] has the determinant bits in posi- 
tions 7 and 12, 67 = bi2 = 1. 

This allows to explain all structural properties of the 
pattern observed in the simulations. We can construct 
an ideal 2-cluster pattern, a configuration in which all 
nodes of group Si are occupied and the others remain 
empty. It is ideal in the sense that there are no defects 
but also no hubs. 

Since all other bits can take all possible combinations, 
the size of the groups can be calculated, e.g. there are 
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2-Cluster 




Stable holes 



FIG. 5. The three groups of the 2-cluster pattern and their 
linking. The circle sizes correspond to the group sizes. The ar- 
rows and the numbers next to them indicate how many nodes 
of a group exert influence on the nodes of another group, e.g. 
each node of S2 is stimulated by 11 nodes from Si. The num- 
ber of links can be counted in simulations or taken from the 
link matrix that is derived in Sec. II V BI 

I Si I = 2 d ~ 2 occupied nodes and IS2I = 2 x 2 d ~ 2 potential 
hubs. 

We further can compute the number of occupied neigh- 
bors n(dv) of a node v of any group. Since all nodes of 
Si are occupied in the ideal pattern, n(dv) is given by 
the number of links between v and nodes in Si. A link 
between two nodes exists if their bitstrings are comple- 
mentary except for up to two mismatches. If v G Si , 
it has two bits in common with all other nodes in Si, 
namely bk and 6; . Thus, all remaining bits must be ex- 
actly complementary. There is only one node w G Si, 
which obeys this constraint. If v € S2 or v € S3, there is 
one pre-detcrmincd mismatch or none, respectively. The 
remaining mismatches can be distributed among the d— 2 
non-determinant bits. Thus a node in Si has 




occupied neighbors in the ideal pattern. For small influx 
this is in good agreement with the simulations, cf. Table 
HT1 In a similar way for all nodes the number of links to 
nodes in different groups can be calculated, the result is 
visualized in Fig. [5] The derivation for the general case 
is given in the next subsection. 

This regularity encouraged the following concept. Con- 
sidering the two determinant bits as coordinates of a 
two-dimensional space, they define the corners of a two- 
dimensional hypercube. The corner with coordinates 
(bk,bi) represents an occupied node, the opposite cor- 
ner (bk, bi) is a stable hole, and the neighboring corners 
of (bk,bi) are potential hubs. We call this structure a 
pattern module, because it is the building block for the 
entire regular configuration. In a different picture, we 
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module B module B 



FIG. 6. Two pattern modules with complementary non- 
determinant bit chains B and B, respectively, on a two mis- 
match base graph with a 2-cluster configuration. The two- 
dimensional modules are congruently occupied, each consist- 
ing of one occupied node (black, -00), two potential hubs 
(gray, -01 and -10) and one stable hole (white, -11). The 
positions and values of the determinant bits are chosen with- 
out loss of generality. The links have no mismatch (bold), 
one (solid), or two mismatches (dashed). There are 2 d ~ 2 pat- 
tern modules, each pair of which with complementary non- 
determinant bits contribute a pair of occupied nodes. 



can also understand an ideal configuration as consisting 
of 2 d_2 congruently occupied 'parallel worlds'. Figure [5] 
illustrates the concept of pattern modules. 

Any choice of the two determining bits is of course 
possible, all corresponding patterns are equivalent, the 
2-cluster pattern is 2 2 x ( 2 )-fold degenerated, where the 
first factor represents the choice of the two determinant 
bits, and the second factor gives the number of possible 
positions of these bits in the bitstring of length d . It is 
the individual history (the realization of the random in- 
flux) , which selects the determining bits and thus breaks 
the symmetry. 



B. General Case 

1 . Groups 

(2) 

Many results for 2-cluster patterns on the G 12 base 
graph can be generalized to more complex architectures 
and other choices of d and m. This includes the 8-cluster 
pattern mentioned in Sec. IIIII and other static patterns 
as well as the dynamic pattern. Their structure is cor- 
rectly described by pattern modules with more than two 
determinant bits. 

In the same way as for the 2-cluster pattern, we de- 
fine the pattern module as a hypercube of dimension d^f , 
where du is the number of determinant bits. There are 
two groups which are represented by only one node in the 



pattern module. One of them is labelled Si. All nodes in 
group Si have the same determinant bits b\ ■ ■ ■ bd M . The 
other groups are ordered such that the determinant bits 
of Sj differ in j — 1 positions from the determinant bits 
of Si. It is clear that there are djvf + 1 groups. Groups 
Sj and Sd M +2-j are equivalent, they have the same size 
and linking properties, see below. In typical patterns, 
occupation breaks the symmetry. We usually label the 
smallest group of the more occupied half as Si , which is 
of course arbitrary. 

The size of the groups can be obtained by elementary 
combinatorics. Since, the i-th group deviates in i—1 out 
of dj\t determinant bit positions of Si, there are (^) 
choices. This number is multiplied by the number of 
pattern modules on the base graph given by 2 d ~^ M . The 
group size is 

\Si\=2 d - d ^f^Y i = l,...,d M + l. (7) 

The factor L^) is called the relative group size, since it 
is the group size normalized by the size of Si. It is inde- 
pendent of the base graph dimension d and the number 
of mismatches m. 

As an example, we can construct 2-cluster patterns on 
a base graph G^ by means of pattern modules with 
exactly one occupied node (Si). The dimension of the 
pattern module c?m then has to equal the number of al- 
lowed mismatches m. The number of qualitatively dis- 
tinguishable groups is rn-t-l, etc. A 2-cluster pattern can 
emerge if the lower threshold is ^ 1 and the upper 
threshold obeys 1 ^ tjj ^ d — m . The 2-cluster pattern 
on 1-mismatch graphs described in [l3j is an instance of 
such a pattern. However, in the 1-mismatch case one half 
of all nodes are occupied, the other half are stable holes. 

2. Linking 

Each node on has k = X^feLo (t) links , k is con- 

strained by the allowed number of mismatches m. We 
consider a pattern with dyi determinant bits. Each node 
in group Si is linked to Lij neighbors in group Sj. The 
Lij are the entries of the link matrix L. L defines the 
architecture of a pattern built of modules of dimension 
d M - 

The dynamics of a node depends on the number of 
occupied neighbors. The mean occupation is a typical 
common property of nodes belonging to the same group. 
Thus, the knowledge of the group membership of the 
node's neighbors is of crucial importance to understand 
its statistical properties. 

Within the concept of pattern modules L can be de- 
rived combinatorially. Recall that the determinant bit- 
string of a node in Si deviates in I — 1 bits from the de- 
terminant bitstring of a node v\ in Si . In the following 
we consider a node «W chosen such that the i—1 bits 
inverse to the corresponding bits of v\ are left-aligned, 
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its non-determinant bits are denoted by B. This choice 
is without loss of generality, because all the arguments 
do not depend on the labeling of the bit position. From 
the nodes in Sj we choose such that the j — 1 bits 
inverse to the corresponding bits of v\ are right-aligned 
and the non-determinant bits are B. There is no other 
node in Sj with less mismatches to »W. 

We have to distinguish whether or not the partial bit- 
strings of length i — 1 and j — 1 do overlap. These cases 
are discriminated by the value of Ay = g?m — i — J + 2. 
There are three cases. 

(i) Ay =0. All determinant bits of yW and y"J are com- 
plementary. 

(ii) Ay >0. «W and share Ay determinant bits with 
v\. This case is illustrated below. 



''1 


b! ... b ( 


_i ^ . . . 


bd M -jH 




-if 2 ■•• 




w «) 


6i ... b, 


-l bi • ■ • 


bd M -j+i 




-if 2 ••• 






6i .. bi 


-l bi . . . 






-if 2 ••• 


bd M 




v » 

2—1 bits 


Ay=d M 


— i— bits 




j — 1 bits 





(hi) Ay < 0. yW and share |Ay| determinant bits 
which are inverse to the corresponding bits of vi, see 
diagram below. 



bi 



yW "b7 

yO') bi 



bd Jf -j+i b d 



-J+2 



bdj, rt +i b dM -ji-2 ■ ■ ■ 6j-i bi 
bd flf -j+i bd M -j+2 ••• bi_i b, ; 



-i-\-j — djvf — 2 bits 



-i+1 bits 



2—1 bits 



The number of mismatches between yW and y'- 7 ' is 
| Ay | . If | A^ | ^ m there is a link between andy*- 7 ^. In 
this case there are further nodes in Sj which link to 
the number of which is calculated combinatorially. There 
are to— |Ay| additionally allowed mismatches which can 
appear among non-determinant and/or determinant bits. 

In cases (ii) and (hi) there are further nodes in Sj with 
| Ajj | mismatches to y W . These are obtained by distribut- 
ing the | Ay | mismatches among the 6lm— *+l right-aligned 
bits in case (ii), or among the % — 1 left-aligned bits in 
case (iii). This leads to ( di A ~* fl ) and (i^ 1 !) nodes in the 
respective cases. 

Now we consider additional mismatches. Among the 
d — d,M non-determinant bits we can distribute I mis- 
matches in 



d-d M 
I 



different ways. 

Among determinant bits additional mismatches can 
only appear in pairs. This is relevant for m— |Ay| ^ 2. If 
we invert one bit in yw, the resulting node is not in Sj 



since the number of bits complementary to v\ is changed. 
We have to invert a second bit so that the number of mis- 
matches with vi remains constant. The number of nodes 
in Sj with |Ay|+2fc mismatches to is computed as 
follows. 

In case (ii) we invert k bits in the i — 1 left-aligned bits 
ofy( J ). There are ('T ) possibilities. To compensate this, 
we have to invert k bits among the j — 1 right-aligned bits 
of ijw, Altogether, there are now Ay + fc mismatches to 
yW living on the d*/ — i+1 right- aligned bits. There are 
( d A^+k) possibilities to distribute them. Thus we have 



i-1 
k 



A,>, 



■i+1 
+ k 



nodes in Sj with Ay 



2k mismatches to yW. 
Case (iii) is similar. We invert k bits in the g?m —i + l 
right-aligned bits of there are ( M ~^ +1 ) possibilities. 
To compensate this, we have to invert k bits among the 
d>M — j + 1 left-aligned bits of v^'. Together, Ay + k 
mismatches to yW live on the i 
are 



T left-aligned bits. There 



I A* -|+fc) possibilities. This leads to 
dM — i+l\{ i - 1 

k A|Ay|+fc 

nodes in Sj with |Ay| + 2k mismatches to yW. 

The result in case (i) is obtained setting Ay = 0, and 
the result for redistributing only |Ay| mismatches de- 
rived before is reproduced for k = 0. 

Since the total number of mismatches between linked 
nodes can not exceed m, I and k are constrained by I + 
2k + | Ay | ^5 m. Summarizing we obtain for Ay 



Lk=0 



d-d M 
I 



i — l\ / du~ i+1 
k )\ +k 



x 1 (Z + 2fc+Ay m) 



(8) 



and for Aij ^ 



E 



v ^ v l 

Lk=0 



d-d 



M 



i-1 
Ay | +k 



d M -i + l 
k 



x a (Z+2fc+|Ay| s$ to) 



(9) 



In some cases the evaluation of the sums in Eqs. O 
leads to simple rules, see [7§|. 

Of course, the total number of links of a node is 



The link matrix L = (Ly) has a symmetry given by 



(10) 



Lij — L 



dM+2-i,dM+2-j 



(11) 



If the link matrix entries of the zth row are multiplied 
by the size of group Si , the resulting matrix with entries 



A; 



| Si | Lij additionally obeys Ay = Aji. 
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FIG. 7. The general structure of the (du + 1) X (<2m + 1) 
link matrix. Matrix entries along the 2m + 1 solid diagonal 
lines are non-zero, cf. Eqs. (JHIO- All other entries are zero. 
Groups in the center {gray square) couple to themselves and 
are naturally called core groups. Among the groups without 
self-coupling we can further distinguish groups that couple to 
the core from those which do not. See further discussion in 
the text. 



It may be of interest to know the number of links from 
a node of Si to nodes of Sj with a given number of mis- 
matches which is denoted by Lf-. It is obtained re- 



placing in Eqs. dHJE) l(/+2fc+|A. y | < m) by S mh}]Ai 
which gives, e.g. for Aj,- ^ 



T 1 ' 



E 



d - d M 
H-2k-A i:j 



i-1 

k 



A i:j +k 



(12) 



Obviously, it holds L,j = ^!T=o L\ 



3. Architecture 

For the link matrix and, thus, for the architecture of 
patterns we can state a number of general properties. 
Because |Ay | = \du — i — i + 2| ^ m is the necessary 
and sufficient condition for the existence of links between 
groups i and j, all non-zero matrix entries are situated on 
a band along the secondary diagonal, see the illustration 
of the general structure of the link matrix in Fig. [7] The 
width of this band is 2m + 1, i.e. m controls the range of 
influence of a group. Group Si interacts with the 2m + 1 

groups Sd M +2-i-m i ■ • ■ j Sd M +2-i+m- 

We always find groups with self-coupling, because 
there is always a quadratic block of non-zero matrix en- 
tries in the center of L. Such groups are called core 
groups. The size of the core, i.e. the number of core 
groups, depends on m and on the existence of a central 



2 (3) 




6 (5) 




FIG. 8. Groups and their linking for even (above), and odd 
dM (below) on a two-mismatch base graph. The lines show 
possible links between nodes of the groups. The circles indi- 
cate the self-coupling within the core groups. It is suggestive 
to call arrangement of the groups caterpillar representation. 



element in L. Such an element exists if c?m is an even 
number. Thus, 



#core groups 



m 
m+1 



if either m or dM odd 



otherwise 



(13) 



Groups without self-coupling exist if the number of 
groups dM + 1 is larger than the number of core groups. 
Such groups are related to the square submatrix of L 
with all entries equal to zero, cf. Fig. [71 

The number of core groups for a given m only depends 
on whether dju is even or odd but is independent on d. 
We can represent the content of Fig. [7] in an alternative 
way exploiting that groups with index i have the same 
properties as those with index dM + 2 — i. In Fig. [5] 
these groups are on the same position in the two par- 
allel strands. The core groups are on the right end of 
the diagram. We distinguish two cases, with even and 
odd number of core groups, respectively, which form the 
"head" of a caterpillar. If we increase dM by two, this 
does not change, but only the "tail" of the caterpillar 
gains an additional segment. This observation can be 
used to determine the scaling of the group sizes for large 
d, cf. the appendix of this paper. 

Note, that all these general properties in this section 
are structural information about the patterns. They only 
depend on the parameters d, m and on the choice of a 
pattern module dM- 



4- Center of Mass 

In the simulations time series of 2 d nodes are gener- 
ated, e.g. to calculate the mean occupation. In or- 
der to identify patterns in real time it has proven use- 
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FIG. 9. (Color online) A typical time series of the center of 
mass vector components, here on with [ij,t«] = [1,10] 
for p = 0.01. The evolution starts from an empty base 
graph, which is gradually occupied thus breaking the sym- 
metry. A stationary state is reached after about 1300 time 
steps. Four components fluctuate around non-zero mean val- 
ues, .Ri, i?7, .Rio « —0.5, and Ru ~ 0.5. Hence, as explained 
in the text, it is an architecture with dtt = 4 determinant 

bits and all nodes in Si have -10 • -0 0. The time series 

of global quantities in Fig. [T] describes a different realization, 
which also evolves to a dhi = 4 pattern. 



ful to reduce this information by introducing - in anal- 
ogy to classical mechanics - a center of mass vector in 
dimension d. We consider occupied nodes, n(v) = 1, 
as unit point masses in a d-dimensional space [— 1, 1] . 
The position vector r(v) of a node v encoded by the 
bit chain bdbd-i ■ ■ ■ foi with bi g {0, 1} has components 
r,; (v) = 2bi — 1 in this space. The center of mass is defined 
as 



R 



1 



n(G) 



n(v)r(v) 



(14) 



The definition of r(v) ensures symmetry with respect to 
r = 0, which implies that for any symmetrically occupied 
pattern, e.g. the completely occupied base graph, we 
have R = 0. 

Figure [9] is a typical example of a time series of the 
d components of R. Stationary states are characterized 
by small fluctuations of the components Ri around some 
average values Ri, think e.g. of a moving average. The 
value of Ri allows in the typical case to decide whether 
or not i is a determinant bit position. 

In general, for all non-determinant positions i we have 
Ri ~ 0. For any choice of determinant bits, the non- 
determinant bits run through all combinations of zeros 
and ones. Therefore, supposed that all nodes within a 
group are occupied with the same probability, the ex- 
pected contribution of each group to the non-determinant 
components of R is zero. 

For patterns which break the symmetry, the Ri for 
determinant bit positions i are non-zero, positive or neg- 
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FIG. 10. For a pattern with dnt = 4 we arrange the 4 determi- 
nant bits as they contribute to the 5 groups Si,...,Ss for the 
case that the determinant bits of Si are 1000, corresponding 
to Fig. H 



ative. We explain in the following example, what can be 
inferred from this information. 

We consider a pattern with c?m = 4. Fig. [TU] shows the 
determinant bits which contribute to the groups of this 
pattern. Non-determinant bits are not shown. In S2 in 
each bit position the respective bits of Si predominate, 
in S4 those of S5. In S3 the respective bits of Si and S5 
occur with the same frequency, cf. Fig. [TU] 

For a symmetry breaking pattern where n(Si U S2) > 
n(S4 U 6*5), the sign of a determinant component Ri is 
determined by the corresponding determinant bit bi(vi) 
of a node vi in Si, sign Ri = 7*4 (ui) = 26j(«i) — 1. The 
other way round, measuring signi?j we can infer bi(vi). 
If n(SiUS2) < n^US^) we can return to the case above 
by relabeling the groups. The determinant bits bj(ui) of 
the pattern in Fig.[UJare for i = 1, 7, 10 and 1 for i = 11. 

The expectation value Ri for a given pattern is eas- 
ily computed in terms of the expected occupation of 
the different groups n(Sj). For a given j a fraction 
(dia— j+l)/d,M of the occupied nodes contributes Ti(vi), 
and a fraction (J — Vj/du contributes — rj(ui). For a de- 
terminant bit at position i we obtain 



Ri 



d M +l 

E 



-2j+2 



n {G) d M 
j=i 



n(Sj 



n(vi), (15) 



where we have supposed that the fluctuations of n(G) 
are small. If i is a non-determinant bit position, the 
arguments leading to Eq. (115)) do not apply, but following 
a different line we obtain Ri « as explained above. 

For the example shown in Fig. [9] we know from simu- 
lations that a static pattern occurs where all nodes of S2 
are occupied and the others empty, i.e. n(G) — n(S*2). 
This leads to Ri w 0.5rj(ui). 

For the typical case of a symmetry-breaking pattern, 
the Ri allow to identify the determinant bits directly 
and in real time. This procedure is remarkably robust 
against defects in the pattern. For symmetric patterns, 
e.g. n(Si U S2) — n(5*4 U S5) in the above example, 
Eq. (|15p gives Ri = 0, determinant and non-determinant 
bits can not be distinguished this way. 
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V. ARCHITECTURE OF SPECIFIC PATTERNS 

In the previous section we derived the structural prop- 
erties. Here we apply these results to a zoo of specific 

(2) 

patterns observed in simulations on G 12 with [tL,tu] — 
[1, 10]. This restriction allows a certain completeness of 
the overview. For other parameters different patterns 
can be found, but inspection of several examples indi- 
cates that the building principles generally apply. 

We start with the static patterns with d M = 2, 4 and 
6 and point out their common properties. We show how 
the concept of pattern modules needs to be extended to 
explain a more sophisticated static pattern. Finally, we 
describe the complex dynamic pattern which is build of 
modules of dimension dm = 11- For all examples below 
and a few more cases the link matrices are explicitly listed 
in the Supplementary Material [801 ] . 



A. Simple Static Patterns 

The simplest static pattern is the c?m = 2 pattern with 
characteristic 2-clusters, discussed in detail in Sec. IIV Al 
For the ideal pattern Eq. (JTSJ) gives R4 « 1 r»(«i) for both 
determinant bit postions, all other Ri s» 0, which is in 
accordance with simulation. 

In the static regime, for < p 0.03, we frequently 
observed a pattern with 8-clusters. It can be described 
with a pattern module of dimension cLm =4. 

It has five groups, which are illustrated in Fig. Q~T] 
together with their links according to the link matrix, 
Eqs. (|8|9jl . Group S% is highly occupied and its nodes 
form the 8-clusters, cf. Fig. [T3] S3, S4 and S 5 are 
the groups of stable holes and Si is the group of single- 
tons. Note the symmetry of the structure in the figure, 
by which we could swap the groups S2 and S4, and the 
groups Si and S5 . The same symmetry is reflected in the 
binomial coefficients in the formula for the group sizes, 
Eq. ([7]), and can be seen in the link matrix, cf. Eq. (fTTj). 

The number of occupied nodes according to Eq. ([7]) is 
1 52 1 = ( 2 ti) x 2 8 = 1024 and the number of singletons 
(unstable holes) is |Si| = (^J x 2 8 = 256. The group of 
stable holes consists of 3 subgroups, the total number of 
stable holes is given by |S 3 |+|S4|+|S 5 | = (6 + 4 + 1) x2 8 = 
2816. This is in agreement with the observations given 
in Tab. H 



For < p iS 0.03 we find occasionally a d 



m = 6- 

pattern with characteristic 30-clusters. It is rare, but 
once established it remains stable for a long time. For 
the determinant bit positions in the ideal pattern Eq. (fT5|) 
yields Ri ~ 0.33ri(vi) as in simulations. 

Its seven groups and their linking are schematically 
shown in Fig. 1131 Each node in group S3 has six links 
within the group. Thus, a completely occupied £3 is sta- 
ble. There are (?) = 15 nodes in the pattern module 
which belong to S3. These are linked to the correspond- 
ing 15 nodes of the opposite module, thus forming the 30- 




Singletons 



-Cluster 



Stable holes 



FIG. 11. Detailed view of the architecture of the 8-cluster 
pattern. As in Fig. [5] the circle sizes correspond to the group 
sizes. The arrows and the numbers next to them indicate how 
many nodes of a group exert influence on the nodes of another 
group according to the link matrix. 
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FIG. 12. A cluster of eight occupied nodes as found in a 
snapshot of the occupied subgraph in the cLm = 4 architec- 
ture. It has the topology of a cube in three dimensions. The 
cluster consists of nodes in 52, which are all occupied up to 
few defects. The nodes are labelled with their bitstrings. The 
four digits represent the determinant bits. For simplicity and 
readability we give the determinant bits of the architecture 
realization in which Si has determinant bits 0000. B rep- 
resents the string of non-determinant bits, and B its inverse. 
All links have two mismatches. 



cluster. The stable hole groups S4 to S7 are suppressed, 
Si and 52 are singletons. 

These simple static patterns dni = 2,4,6 have in com- 
mon a very regular structure and a mechanism of self- 
support and suppression. There is one fully occupied 
core group with a self-coupling within the threshold win- 
dow [tL,tu] = [1,10]. Its nodes form clusters, sustain 
themselves and suppress all other groups except for the 
singletons, which are surrounded by stable holes only. 
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FIG. 13. The architecture of the 30-cluster pattern. As in 
Fig. [S] the lines show possible links between nodes of the 
groups. 



At p = the group of singletons will be emptied, the 
remaining occupied nodes which are connected to at least 
th occupied nodes survive. An influx p > perturbs the 
pattern and tests its stability. Increasing p further leads 
to more and more defects until, finally, the whole pattern 
will be destabilized. 

All patterns observed in simulations can be well ex- 
plained with the concept of pattern modules. On the 
contrary, not all patterns that can be constructed with 
pattern modules, have been actually observed. For many 
unobserved patterns we can explain why they are either 
forbidden or very rare. 

By the pattern module dj& = 8 we can construct a 
static 112-cluster pattern in the same manner as 2-, 8- 
and 30-cluster patterns. We occupy the core group S4, 
which consists of clusters of size 112. Each node in such a 
cluster has 10 occupied neighbors, all within S4. A small 
perturbation by occupying an additional neighbor desta- 
bilizes the pattern. This is the reason for its rareness. 
For even d,M > 10 the number of links within the core 
group exceeds tjj so that a completely occupied core is 
impossible. 

For odd <1m the self-coupling of the core groups is so 
strong, La ^ 12 > tjj, cf. [80], that static patterns with 
one completely occupied group are excluded. We could 
increase tjj to allow some of these static patterns. 

For our choice of m = 2 in patterns with c?a/ ^ 7 
there are groups, at the end of the tail of the caterpillar, 
outside the range of influence of the core. An occupied 
core is not able to suppress these groups. The range of 
influence could be increased by increasing m. 



B. Static Pattern with Two Modules 

There are static patterns that can not be explained 
with a single module, but by an extension of the con- 
cept to two pattern modules. We explain this with the 
following example. 

In the regime of static patterns < p ;$ 0.035, we often 
find a pattern that is characterized by large star-shaped 
clusters of size 24 and accompanying small clusters of size 
four with one central node surrounded by three separate 
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FIG. 14. A 24-cluster and two 4-clusters as observed in a 
snapshot of the occupied subgraph in the 2-module architec- 
ture with cIm = d' M — 3. The nodes are labeled with their 
bitstrings. The non-determinant bits are subsumed under B 
or its inverse B, respectively. The two groups of three digits 
represent the determinant bits, corresponding to the first and 
second module. For readability the determinant bits are cho- 
sen such that group Si <g> S[ is encoded by 000 000. All links 
have two mismatches. Figure created using yEd [7S| 



nodes attached to it, cf. Fig. [T4l 

It has 6 determinant bits, but the structure can not 
be explained by a single pattern module of dimension 6. 
However, an architecture constructed with two pattern 



modules of dimension cLm 
in full agreement with the observation. 



and d' M with du 



d' 



M 



A node group is now denoted by Si®Sj, the former cor- 
responds to the first module, the latter to the second one. 
The group Si ® S[ has determinant bits bib 2 b 3 6^63^3, 
in the example given in Fig. [I4]we have chosen 000 000. 
The index k of S& has the same meaning as for a single 
module. The determinant bits of group Si ® Sj deviate 
in i — 1 bits from 6162^3 and in j — 1 bits from b[b' 2 b' 3 . 
The deviations in the first set of bits are independent 
from those in the second set of bits. Therefore, not only 
7 groups are generated as for a single module of c?m = 6, 
but we find (d M + 1)K M + 1) = 4 x 4 = 16 groups. The 
relative size of group Si (g) Sj is (f^[) (f*\) . Also the link 
matrix can be calculated. The number of links from a 
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node Vij G Si ® to nodes in 5 r (8 S£ is given by 



Singletons 



Periphery Core 



d—d,M—d M 
I 



(16) 



i - 1 



fc+max(0, — Ay ) / \A;+max(0, A 



<£m — i + 1 



r- 1 



r + 1 



v A:'+niax(0,-A; s )y \fc'+max(0, 
x l(; + 2fc+2fc' + |Ay| + |A;j < m 



where A' 



d', 



M ' 



■ s + 2 in analogy to A^- . This is 



a condensed notation for the four cases discriminated by 
A„,A^0. 

We obtain the ideal 24-clusters if we occupy the groups 
S 2 ® S' 2 for the nodes on the ring and S3 <g> S[ for the 
peripheral nodes. Similarly the small 4-clusters are build 
of occupied groups S4®S[ as the central node and Si®S' 2 
as the attached nodes. In this static pattern more than 
one group is occupied. These groups mutually stimulate 
each other instead of supporting themselves. 

For two modules the center of mass components R4 dif- 
fer not only between determinant and non-determinant 
bit positions, but also between determinant bits belong- 
ing to different modules. For the determinant bit at po- 
sition i Eq. (ITSl) becomes 



Ri 



1 



d M +l d M+ l ,(/) 



■2jW+2 _ 



3=1 31- 



{s 3 ®s' r 



(17) 



where r^v^i) is the position vector component of a node 

in Si ® 5( and the prime in parentheses at d^ and j'W 
only applies if i is a position in the second module. For 
our pattern with c?m = c?m = 3 we observe Ri « 0.29 for 
the first module and iij ~ 0.57 for the second. 

An extension to several modules of possibly different 
dimension appears natural. 



C. Dynamic Pattern 

In simulations starting from the empty base graph for 
p ^ 0.03 we only find a dynamical, stationary pattern 
with complex architecture. It was first observed in simu- 
lations in There, six groups of nodes sharing statis- 
tical properties were identified and the architecture was 
described on a phenomenological base. There are groups 
of singletons, stable holes, two peripheral groups, and 
two core groups. A snapshot of the occupied nodes and 
their linking is given in Fig. [3] 

All structural properties, namely the number and size 
of the groups and their linking, can be explained within 
the concept of pattern modules using pattern modules of 
dnt = 11. This leads to twelve groups, which for a certain 
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FIG. 15. (Color online) Visualization of the 12-group struc- 
ture for 0.03 < p < 0.045 (top) and for p > 0.045 (bottom). 
As in Fig. [8] the lines show possible links between nodes of 
the groups. The coloring of the groups in the upper figure 
corresponds to the color of the respective nodes in Fig. 
according to the qualitative classification. 



range of p can be merged to the six phenomenological 
groups above. Groups Si to are groups of singletons, 
and groups Sg to S12 are stable holes, see Fig. [15] (top) 
and Tab. UTTl 

Table IIIII in the first two rows give the mapping from 
the twelve groups Si to the six groups Sj found empir- 
ically in [13]. The derived group sizes \Si\ are in ex- 
cellent agreement with the measured group sizes. Also, 
the sizes of the subgroups Sg , S9 , S10 , Sii , and S12 
correctly sum up to 1124, which is exactly the statis- 
tically measured number of stable holes, see Tab. I in 
[HI]. Besides the structural information, Tab. IIIII also 
shows group averages of local node characteristics, such 
as mean life time and mean occupation. For the group 
occupations (n(v))si f° r P — 0.028 given in Tab. IIIII we 
can determine R4, for the determinant bits Eq. (|15p and 
direct observation give Ri w 0.29. 

We calculated the link matrix for this pattern, cf. Ta- 
ble IIVI In contrast to the static patterns that emerge for 
low influx p in this structure we also find perfect matches 
and 1-mismatch links, but they are simply outnumbered 
by the 2-mismatch links. 

The structural properties and the mean occupation of 
the groups obtained in simulations allow to understand 
the qualitative behavior. The phenomenological classifi- 
cation in holes, singletons, etc. depends on p. 

For a range 0.03 ~ p ^ 0.045 the following qualitative 
groups appear. There are stable holes as in static pat- 
terns. Singletons are surrounded by stable holes. Core 
groups couple to all groups except for singletons. The 
number of self-couplings is larger than the upper thresh- 
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TABLE III. The node characteristics of the 12-groups structure. Data from 500,000 iterations for p — 0.028. 
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TABLE IV. Linkmatrix for cLm = 11. Missing entries are zero. 
This is an instance for the general structure shown in Fig.fU 
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old tjj- Therefore a complete occupation is not stable. 
However, there are many configurations of partial occu- 
pation. The periphery couples to the core groups and 
to stable holes. It is stimulated by momentarily, during 
the influx occupied stable holes and by the stationary 
occupied core. It is the group with the highest occupa- 
tion. The core groups have more links to occupied groups 
than the periphery and therefore its mean occupation is 
smaller than that of the periphery. 

Although the pattern only evolves for p <J 0.03, the 
connected part of the network, the giant cluster, sur- 
vives if the influx is stopped. Since II = 1, all isolated 
nodes disappear. Starting with a small p ^ 0.03 from the 
giant cluster, leads to a similar scenario as described in 
Sec. MI Al where the giant cluster decays. This shows that 
the dynamic pattern requires a sufficiently high influx 
to emerge and to remain stationary. The influx perma- 
nently tests the stability of this pattern against random 
perturbations. 

Increasing the influx above p ^ 0.045 also the core 
groups are suppressed by a growing population of pe- 
ripheral nodes. The occupation of the core groups thus 
converges to the occupation of the stable holes while in 



the same process the occupation of peripheral groups and 
singletons converges. Thus, based on the same pattern 
module as above we can distinguish considering mean oc- 
cupation and life time only two groups, stable holes and 
singletons, cf. Fig. [15] (bottom). Note, that taking the 
number of occupied neighbors into account we can still 
distinguish groups Se , ■ ■ ■ , 5*9 . 

For even higher p ^ 0.07 the dynamic pattern becomes 
transient. It often breaks down and rebuilds with a differ- 
ent orientation on the base graph. Nodes can no longer 
be permanently assigned to one of the twelve groups, 
but they change their group membership with every new 
formation of the pattern. Thus, a statistical characteri- 
zation of nodes by temporal averages is impossible. 

For still larger p tjj / n — 0.127 the dynamics is com- 
pletely dominated by the random influx. The graph is so 
densely occupied after the influx that typically the upper 
threshold of the window rule is exceeded. 

For du ^ 7 holes at the tail of the caterpillar are 
not sufficiently suppressed by the core, cf. the discus- 
sion at the end of Sec. IV Al and Fig. [51 Suppression by 
the periphery, du JS= 7, and by singletons, du H, is 
required. Otherwise these holes could become occupied, 
which would destabilize the complete pattern. Since sin- 
gletons need sufficient influx, those patterns occur only 
for higher p. Interestingly, among several candidates only 

the dynamic pattern with dm — 11 has been observed on 
(2) 

G 12 for our setting of [t^, tjj] and < p < 0.11. It is also 
the only observed pattern with odd module dimension. 

In this context we recall that the autonomous dynam- 
ics of the idiotypic network was a truly central feature 
of Jerne's original concept The existence of core and 
periphery fits perfectly to the second generation idiotypic 
networks of Varela and Coutinho \& . 



VI. CONCLUSIONS 

We considered a minimalistic model [l3[ of the idio- 
typic network which evolves towards a complex func- 
tional architecture. The main mechanisms are the ran- 
dom influx of new idiotypes and the selection of not 
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sufficiently stimulated idiotypes. Numerical simulations 
have shown that after a transient period a steady state 
with a specific architecture is reached. Generally, groups 
of nodes sharing statistical properties can be identified, 
which are linked in a characteristic way. 

In the present paper we achieved a detailed analytical 
understanding of the building principles of the emerg- 
ing patterns. Modules of remarkable regularity serve as 
building blocks. We can calculate size and connectivity 
of the groups in agreement with numerical simulations. 

The described building principles are formulated gen- 
eralizing regularities found by a careful analysis of the 
simplest pattern. The architectures of all patterns ob- 
served so far can be described by these simple building- 
principles. 

For a suitable parameter setting the network consists 
of a central and a peripheral part, as envisaged in the 
concept of second generation idiotypic networks @ . The 
central part is thought to play an essential role, e.g. in 
the control of autoreactive clones, the peripheral part to 
provide the response to external antigens and to keep a 
localized memory. An ad hoc architecture similar to the 
one described here was used in [8l| to investigate the role 
of the idiotypic network in autoimmunity. 

While the model is clearly immunologically inspired, 
it is worth pointing out that the results are interesting 
in themselves. The process of self-ordering is genuinely 
evolutionary. The local processes of addition and dele- 
tion of nodes involve randomness (random influx) and 
selection (window rule), they have a global impact. On 
a longer time scale we observe an evolution towards a 
complex architecture. Depending on the parameters this 
architecture can consist of many regularly arranged small 
modules or of few large modules covering the complete 
base graph. Most interestingly, for a range of parame- 
ters, we find a dynamic stationary pattern comprising a 
central and a peripheral part. Also evolved real world 
networks like internet [82[ and brain [83| exhibit central 
and peripheral parts. 

The analytical understanding of the principles opens 
the possibility to consider networks of more realistic size 
and to investigate their scaling behavior, e.g. exploiting 
renormalization group techniques, cf. |84j |. 

Future steps will include to check whether a similar un- 
derstanding can be reached for more realistic models. For 
example, we think of matching rules allowing bitstrings 
of different lengths, of links of different weight for vary- 
ing binding affinities, of several degrees of population for 
each idiotype, of hypermutation during cell proliferation 
85], and of a delay of deletion of unstimulated clones. 
We also want to study the changes in the architecture 
during the life time of an individual. 

First results of simulations on base graphs with varying 
link weights and several occupation levels indicate that 
the same building principles apply. Patterns with more 
than one pattern module appear to be more frequent [86[ . 

Ongoing studies investigate the evolution of the net- 
work in presence of self-antigen or an invading foreign 



antigen in terms of whether the network tolerates or re- 
jects them [86ll87t 

In a forthcoming article [88[ we report on a modular 
mean field theory to compute statistical group charac- 
teristics for arbitrary parameters and any given pattern. 
The mean occupation, the mean life time and the mean 
occupation of the neighborhood are in good agreement 
with the simulations. 



Appendix: Scaling of the Relative Group Size with 
Increasing System Size 

Suppose the biological system can express about 10 11 
different genotypes for antibodies Q. This would corre- 
spond to a bitstring length d — 36. The system size in 
simulations is limited by the computer resources. Still, 
today the biological system size is hard to reach. On the 
other hand, the thermodynamic limit is not of interest, 
because the biological system is large but finite. There- 
fore, the scaling of qualitative properties with increasing 
system size is important. 

In the following we consider patterns with odd c?m = 
d— 1. For du ^ 7 we find from the "caterpillar represen- 
tation" , cf. Fig. [51 an architecture of two core groups, 
two peripheral groups, and several groups of stable holes 
and of singletons. With increasing d only the tail of the 
caterpillar grows, i.e. the number of stable hole and sin- 
gleton groups increases. How does the relative size of the 
different qualitative groups scale with d? 

The size of the groups is given by Eq. ([7]) which reduces 
for d/vf = d — 1 to 



15,1 = 2 



d-1 
i-1 



i = l, 



(A.l) 



We can order the groups with increasing i , then the two 
core groups are the two central groups for i = d/2 — 1 
and d/2 which both have the same size. Thus the total 
size of the core groups is 



|C| = 2 2 



d-1 
d/2 



(A.2) 



The peripheral groups are the two groups left of the 
core groups, i = d/2 — 2 and d/2 — 3, which have the 
total size 



\P\ 



d-1 
y d/2 - 2 

Elementary algebra yields 



d-1 
d/2 - 3 



IP]- d ( d - 2 ) |c| 



(A.3) 



(A.4) 



Left to the peripheral groups are the groups of sin- 
gletons, S, and right to the core groups are all groups 
of stable holes, H, cf. Tabic Mil for the example with 
d=12. 
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Because of (£) = (^J we have 

\H\ = \S\ + \P\ (A.5) 

and obviously 

|5| + |P| + |C| + | J ff| = |G ( i|=2 d . (A.6) 
With this knowledge we infer 

\H\ = l/2(\G d \-\C\) (A.7) 

and 

\S\ = \H\-\P\. (A.8) 
For large d we obtain in Stirling approximation 




(A.9) 



Thus, the relative size scales with d as 

^wr^U 1 -^)-^- (A - 10) 



In the thermodynamic limit |C| and \P\ tend to zero. 
For a realistic size of the network, e.g. d = 36, we have 
for the qualitative groups \C\ « 0.266, \P\ w 0.214, \H\ w 
0.367, and |5| « 0.153. |C| and |P| together account for 
a half of all nodes. 

Of course, the parameters p and [tz,,tj/] have to be 
suitably adjusted such that a pattern of the supposed 
architecture actually emerges. 
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